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Abstract 

In this paper, we set forth a new algorithm for generating approximately uniformly random 
spanning trees in undirected graphs. We show how to sample from a distribution that is within 
a multiphcative (1 + 5) of uniform in expected time 0{m^/n\ogl/S). This improves the sparse 
graph case of the best previously known worst-case bound of 0(min{mn, n^-^^^}), which has 
stood for twenty years. 

To achieve this goal, we exploit the connection between random walks on graphs and electrical 
networks, and we use this to introduce a new approach to the problem that integrates discrete 
random walk-based techniques with continuous linear algebraic methods. We believe that our 
use of electrical networks and sparse linear system solvers in conjunction with random walks and 
combinatorial partitioning techniques is a useful paradigm that will find further applications in 
algorithmic graph theory. 
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1 Introduction 



In this paper, we set forth a new algorithm for generating random spanning trees in undirected 
graphs. Random spanning trees are among the oldest and most extensively investigated proba- 
bilistic objects in graph theory, with their study dating back to Kirchoff's work in the 1840s |13j . 
However, it is only in the past several decades that researchers have taken up the question of how 
best to generate uniformly random spanning trees algorithmically. This question became an active 
area of research in the 1980s and 1990s, during which a long string of papers appeared that provided 
successively faster algorithms for this task (e.g., [HI H [7[ I8ll5l [Tl IT21I20] ) . 

Previous algorithms for this problem broadly fall into two categories: determinant-based algo- 
rithms and random walk-based algorithms. The starting point for the determinant-based algorithms 
was Kirchoff's Matrix Tree Theorem, which reduces counting the number of spanning trees in a 
graph to the evaluation of a determinant [13] (see, e.g., Ch.2, Thm. 8 in [4j). The first such algo- 
rithm produced a random spanning tree in time 0{mrfi) [111 I14j . After sequence of improvements 
([3 [6]), this line of research culminated in the algorithm of Colbourn, Myrvold, Day, and Nel [8j, 
which runs in the amount of time necessary to multiply two nx n matrices, the best known bound 
for which is 0(n2-376) [gj. 

The random walk-based algorithms began with the following striking theorem due to Broder 
[5] and Aldous [Ij: 

Theorem 1. Suppose you simulate a random walk in an undirected graph G = (y,E), starting 
from an arbitrary vertex s and continuing until every vertex has been visited. For each vertex 
V ^ V \ {s}, let e^ be the edge through which v was visited for the first time in this walk. Then, 
T = {e^ \v £ V \ {s}} is a uniformly random spanning tree of G. 

This immediately yields an algorithm for generating a random spanning tree whose running 
time is proportional to the cover time of G. If G has n vertices and m edges, the cover time can be 
Q{mn) in the worst case, but it is often much smaller. For sufficiently sparse graphs, this yields a 
better worst-case running time than the determinant-based algorithms. Since one clearly needs to 
see every vertex in G, it would seem unlikely that such methods could run in less than the cover 
time of the graph. However, in the last major breakthrough in this line of research, Wilson [20j 
showed that, by using a different random process, one could actually generate spanning trees in 
expected time proportional to the mean hitting time of the graph, which can be much smaller than 
the cover time (but has the same worst-case asymptotics). These algorithms generate an exactly 
uniform random spanning tree, but they remain the best known algorithms even if one wants to 
generate a spanning tree from a distribution that is within some multiplicative (1 + 5) of uniform. 
(We will call this a (^-random spanning tree; we shall define it more precisely in section [TTTl ) 

The worst-case running time bound of 0{mn) has stood for twenty years. In this paper, our 
main result is a new algorithm that offers a better worst-case time bound: 

Theorem 2. Let G be a graph with n vertices and m edges. For any S > 0, we can generate a 
6-random spanning tree of G in expected time 0{m^/nlogl/6). 

Beyond the classical applications of generating random spanning trees that motivated the orig- 
inal work on the problem, there have been some developments that further motivate their study. 
In particular, a recent paper of Goyal, Rademacher, and Vempala [10] showed how to use random 
spanning trees to generate efficient sparsifiers of a graph, and they then explained how this could 
be used to provide a scalable and robust routing scheme. 
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We believe that our techniques are of independent interest and may provide a good set of 
tools for the solution of other problems. On a broad level, we would like to highlight the use of 
electrical flows and linear systems to approach combinatorial questions about graphs. In addition 
to our work, they were recently used by Spielman and Srivastava [17], and Batson, Spielman and 
Srivastava [3] to provide much stronger sparsifiers than were previously available. These and the 
the present work illustrate two distinct ways in which electrical flows on a graph provide much 
richer information than that conveyed by just the graph spectrum. The fact that they may be 
found in nearly-linear time [18] provides a powerful tool for answering questions about graphs. To 
our knowledge, this is the first use of such techniques in combination with combinatorial ones to 
provide a faster algorithm for a purely combinatorial question. We believe that this is an exciting 
new frontier of algorithmic spectral graph theory that will find many further applications. 

In addition, it is interesting to note that our algorithm takes a graph problem for which fast 
matrix multiplication provides the best known methods for dense graphs and uses a method based 
on a sparse linear system solver to obtain a better running time for sparse graphs. There are a large 
number of graph problems for which fast matrix multiplication provides the best known running 
time. Our techniques suggest a general template for using the linear algebraic intuition underlying 
these algorithms in conjunction with sparse linear algebra routines to obtain faster algorithms on 
sparse graphs. 

1.1 5-random spanning trees and arborescences 

Formally, we consider the following problem: given an undirected graph G = iy, E) with n vertices 
and m edges, find a randomized algorithm A that, for each spanning tree T of G, outputs T with 
probability 1/|T(G)|, where T{G) is the set of all spanning trees of G. We will be interested 
in a slightly relaxed version of this problem in which we require A to output each tree T with 
a probability p{T) that is 6-far from the uniform i.e. (1 - 6)/\T{G)\ < p{T) < (1 + 6)/\T{G)\, 
for some parameter (5 > 0. We call this generating a 6-random spanning tree. We note that our 
algorithms' dependence on 5 occurs only because some of our algebraic manipulations are carried 
out only to finite precision. As such, we depend only logarithmically on 1 /5, not polynomially. 

For technical reasons that will arise later in the paper, it will be useful to consider arborescences 
in addition to trees. For a given s € G, an arborescence T rooted at s is a, directed spanning tree 
of G in which all vertices in G \ {s} have exactly one incoming arc. We use the notation ^(r) to 
denote the root of an arborescence, and we use eT{v), for any v £ G \ {r(T)}, to denote the unique 
arc incoming to f in T. 

We say that a procedure A generates a conditionally random arborescence if it outputs a vertex 
s and an arborescence T rooted at s such that the probability, conditioned on some s being a root, 
of outputting a particular arborescence T rooted at s is 1/|7^(G)|, where Ts{G) is the set of all 
arborescences in G rooted at 50 Note that in this definition we do not make any restrictions on the 
probabilities pa{s) that the generated arborescence output by A is rooted at s. Now, it is easy to see 
that, once we fix some s £ G, there is one-to-one correspondence between spanning trees of G and 
arborescences rooted at s. Indeed, given any spanning tree, there is a unique way of directing its 
edges to make it a valid arborescence rooted at s; conversely, given any arborescence rooted at s, we 
can obtain a spanning tree by just disregarding the direction of the edges. As a result, if we have a 

^For brevity, we will hereafter omit the word "conditionally" when we refer to such an object. We stress that this 
is an object that we are introducing for technical reasons, and it should not be confused with the different problem 
of generating a uniformly random arborescence on a directed graph. 
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procedure A that generates a random arborescence then, for a given spanning tree T, the probabihty 
that it will be generated is exactly Y1sggPa{s)/\%{G)\ = EseG^'A(s)/|'?'(G')| = 1/|T(G)|. This 
means that if we interpret the arborescence returned by ^ as a spanning tree then we get in this 
way a random spanning tree. By completely analogous reasoning, we get that a procedure A that 
generates 5-random arborescences gives us a procedure that generates (5-random spanning trees. 

1.2 An outline of our approach 

To describe our approach, let us consider a random walk X in G whose starting vertex is chosen 
according to the stationary distribution of G. If we just simulate X step- by-step until it covers the 
whole graph G, Theorem [T] asserts that from a transcript X(uj) of this simulation we can recover a 
random arborescence that is rooted at the starting vertex s{uj) of X{uj). However, while the whole 
transcript can have an expected length of 0(mn), we utilize only a tiny fraction of entries — the 0{n) 
entries that allow us to recover arcs 6^ for 'U G \ {5(cc')}. It is thus natural to wonder whether the 
generation of the whole transcript is necessary. The random walk may spend long periods of time 
walking around regions of the graph that have already been covered, which seems quite wasteful. 
One may ask whether it is possible to identify and avoid such situations by somehow skipping such 
unnecessary steps. That is, one may ask whether there is a way of shortcutting the walk X such 
that the corresponding transcripts are much shorter, can be generated efficiently, and still retain 
the information that we need to recover the desired arborescence. We note that this intuition is 
quite reasonable for many of the standard examples of graphs that have large cover time, which 
consist of regions that are covered very quickly but in which the walk spends much of its time. 

A tempting way to obtain such a shortcutting would be to try to just cut out from X all of its 
parts that correspond to visiting already explored parts of G. This shortcutting yields transcripts 
of length 0(n) and contains all of the information that we need. Unfortunately, it is not clear 
whether an efficient way of generating such transcripts exists — it is quite possible that the fastest 
way to find the next edge to an unvisited vertex traversed by the walk is to generate the whole 
trajectory of X inside the previously visited parts of G step- by-step. 

The core of our approach is showing that there indeed exists an efficient way of shortcutting X. 
On a high level, the way we obtain it is as follows. We start by identifying a number of induced 
subgraphs Di, . . . , of G such that the cover time of each Di is relatively small, and the set C of 
edges of G that are not inside any of the DiS constitutes a small fraction of the edges of G. Now, 
we shortcut the walk X in G by removing, for each i, the trajectories of X inside Di that occur 
after X has already explored the whole Djcl Such shortcutted transcripts clearly retain all of the 
information that we need. Moreover, we show that the fact that DiS have small cover time and C has 
very small size imply that the expected length is also small. Finally, by exploiting the connection 
between random walks and electrical flows together with the linear system solver of Spielman and 
Teng [18] , we provide an efficient procedure to approximately generate such shortcutted transcripts 
of X. All together, this yields the desired procedure for generating 5-random arborescences. 

1.3 Outline of the paper 

The outline of the paper is as follows. In section [21 we formally define the decompositions of G in 
which we are interested. We then relate the structure of the random walk X to these decompositions. 
Next, in section [3l we show how these ideas can be used to develop an algorithm that generates 

^Later we modify this procedure slightly to get better running time for dense graphs. 
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(conditionally) (5-random arborescence in expected time 0{m? /^ynlog 1/5). Finally, in section U we 
prove Theorem[2]by refining the previous algorithm to make it run in expected time 0(m-y/n log 1/5). 

2 The structure of the walk X 

In this section, we shall formally define the decomposition of G that will be the basis of our 
algorithm and prove several important facts about the structure of the walk X with respect to this 
decomposition. 

2.1 (0, 7)-decompositions 

While the algorithm sketched in the previous section could be made to use only edge cuts, our 
faster algorithm in section [J] will require both vertex and edge cuts. To facilitate this, we shall use 
a decomposition that permits us to keep track of a both a set of edges C and a set of vertices S in 
the cut. 

To this end, let {Di, . . . , Dk, S, C) denote a partition of G such that S C |Jj V{Di) = 

V{G) \ S, the Di are disjoint induced subgraphs of G, and C = E{G) \ |J^ E{Di) is the set of edges 
not entirely contained inside one of the D^. For a given Di, let U{Di) be the set of vertices of Di 
incident to an edge from C, and let G{Di) be the subset of C incident to Di. 

Definition 3. A partition {Di, . . . , D/., S, G) of G is a {(p, 7)-decomposition of G if: 

1. |C| <</.|ii;(G)|, 

2. for each i, the diameter "i{Di) of Di is less than or equal to 7, and 

3. for eachi, |C(A)| < |^(A)|. 

Note that this definition does not constrain the size of S explicitly, but \S\ is implicitly bounded 
by the fact that all edges incident to vertices of S are included in G. 

Intuitively, the above decomposition corresponds to identifying in G induced subgraphs Di, . . . , Dj. 
that have diameter at most 7 and contain all but a 4> fraction of edges of G. We bound the diam- 
eters of the DiS and ensure (by the third condition) that they are not too "spread out" in G, since 
these properties will allow us to prove that X covers each of them relatively quickly. We will do 
this using the approach of Aleliunas et al. [2] that proved that the cover time of an unweighted 
graph G' with diameter 7(G') is at most 0(\E{G')\-i{G')). 

For now, let us assume that we have been given some fixed ((/>, 7)-decomposition (Di, . . . , Dk, S, G) 
of G (for some 7 and (p that we will determine later) . In Lemma [TTl we will show that such decom- 
positions with good parameters exist and can be constructed efficiently. 

2.2 The walk X 

Let X = {Xi) be a random walk in G that is started at a vertex chosen according to the stationary 
distribution of G, where Xi is the vertex visited in the i-th step. Let r be the time corresponding 
to the first moment when our walk X has visited all of the vertices of G. Clearly, E{t) is just the 
expected cover time of G, and by the fact that G has m edges and diameter at most n, the result 
of Aleliunas et al. [2] yields 
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Fact 4. E[t] = 0{mn). 



Let Z be the random variable corresponding to the number of times that some edge from C 
was traversed by our random walk X, i.e., Z is the number of i < r such that Xi = v, Xi^i = v' , 
and {v,v') £ C. Since, by Fact HI the expected length of X is 0{mn), and we choose the starting 
vertex of X according to stationary distribution of G, the expected number of traversals of edges 
from C by X is just proportional to its size, so, on average, every m/|C|-th step of X corresponds 
to an edge from C. Therefore, the fact that in our decomposition |C| < (/>|-E(G)| implies: 

Fact 5. E[Z] = 0{(t>mn). 

Let Zi be the random variable corresponding to the number of times that some edge inside Di 
is traversed by our walk. By definition, we have that r = Yli + ^- Now, for each Di, let Tj be the 
time corresponding to the first moment when we reach some vertex in U{Di) after all the vertices 
from Di have been visited by our walk X. 

Finally, let be the random variable corresponding to the number of times that some edge 
from E{Di) is traversed by our walk X until time Tj occurs, i.e., until X explores the whole subgraph 
Di. The following lemma holds: 

Lemma 6. For any i, E[Z*] = 0(|^(A)|7(A))- 

Before we proceed to the proof, it is worth noting that the above lemma does not directly follow 
from result of Aleliunas et a/. (2]. The reason is that [2j applies only to a natural random walk in 
Di, and the walk induced by X in Di is different. Fortunately, the fact that |C(Z)i)| < \E{Di)\ 
allows us to adjust the techniques of ^ to our situation and prove that a bound similar to the one 
of Aleliunas et al. still holds. 

Proof: Let us fix D = Di. For a vertex v £ V{D), let dciv) be the degree v in G, and let d£,{v) be 
the degree of v in D. Clearly, dciy) > doiv), and dciy) = dciy) — doiy) is the number of edges 
from G incident to v. For u,v £ U{D), let p^^ be the probability that a random walk in G that 
starts at u will reach v through a path that does not pass through any edge inside D. 

Consider a (weighted) graph D' , which we obtain from D by adding, for each u,v £ U{D), an 
edge {u,v) with weight ddu) -Puv (^^l edges from D have weight 1 in D'. Note that we do not 
exclude the case u = v, so we may add self-loops.) By the fact that the Markov chain corresponding 
to the random walk X in G is reversible, dc{u) ■ p^^ = dciv) ■ p^ui so our weights are consistent. 

Now, the crucial thing to notice is that if we take our walk X and filter out of the vertices that 
are not from D, then the resulting "filtered" walk Yd will just be a natural random walk in D' (with 
an appropriate choice of the starting vertex). In other words, the Markov chain corresponding to 
the random walk in D' is exactly the Markov chain induced on V{D) by the Markov chain described 
by our walk X. As a result, since Z* depends solely on the information that the induced random 
walk Yd retains from X, it is sufficient to bound with respect to Yd- However, it is easy to see 
that, in this case, E[Z*] can be upper-bounded by the expected time needed by a random walk 
in D' , started at arbitrary vertex, to visit all of the vertices in D' and then reach some vertex in 
U{D). We thus can conclude that E[Z*] is at most twice the cover time of D' . (More precisely, it 
is the cover time plus the maximum hitting time.) 

Now, it is well-known (e.g., see [16j) that the cover time of any undirected graph G' is at 
most 2log\V{G')\Hjnax, where Hmax{G') is the maximal hitting time in G' . Our aim is therefore 
to show that Hmax{D') = 0{\E{D)\-i{D)), where 7(L>) is the diameter of D. We will do this 
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by applying the approach of Aleliunas et al, who proved in [2] that for an unweighted graph G' , 
HmaxiG') < \E{G'MG'). 

To achieve this goal, let K be some number such that all weights in D' after multiplication by 
K become integral (which we can do since all of the weights in D' have to be rational). Let K ■ D' 
be the unweighted multigraph that we obtain from D' by multiplying all of its weights by K and 
then interpreting the new weights as multiplicites of edges. Note that the natural random walk in 
K ■ D' (treated as a sequence of visited vertices) is exactly the same as in D' . 

Now, we prove that for two vertices v and w of D' such that an edge (v, w) exists in D, the 
expected time H^^^^K ■ D) = H^^^^D') until a random walk in K ■ D' that starts at v will reach 
w is at most 4:\E{D)\. To see this, note that the long-run frequency with which an copy of an 
edge is taken in a particular direction is 1/(2M), where M is total number of edges of K ■ D' (and 
we count each copy separately). Thus one of the K copies of edge {v,w) is taken in the direction 
from V to w every i^/(2M)-th step on average. This in turn means that Hy^yj{D') < 2M/K. Now, 
to bound M, we note first that M < K{\E{D)\ + J2uveU{D) dG{u)Pu,v)- Thus, since for a given 
u G U{D) the probability '^^p^y of going outside D directly from u is equal to dc{u)/dG{u), we 
obtain that M < K{\E{D)\ + J^^dciu)) < 2K\E{D)\ by the property of a (0, 7)-decomposition 
that requires that \C{D)\ < \E{D)\. We can thus conclude that H^^^iD') < 2M/K < 4:\E{D)\, 
as desired. Having obtained this result, we can use a simple induction to show that H^^^^D') < 
4\E(D)\A{v,w), where A{v,w) is the distance between v and w in D. From this, we can conclude 
that Hmax{D') = A\E{D)\-f{D) and E[Z*] < 16 log n|^(A)|7(A), as desired. 

I 

3 Our algorithm 

Let us now focus our attention on some particular Di from our (</), 7)-decomposition of G. The idea 
of our algorithm is based upon the following observation. Suppose we look at our random walk X 
just after time Tj occurred. Note that this means that we already know for all v £ V{Di) which 
arc we should add to the final arborescence. Therefore, from the point of view of building our 
arborescence, we gain no more information by knowing what trajectory X takes inside Di after 
time Tj. More precisely, if, at some step j, X enters Di through some vertex v G V{Di) and, after k 
steps, leaves through some edge (u, u') E C, where u £ V{Di) and u' ^ V{Di), the actual trajectory 
Xj, . . . ,Xjj^k does not matter to us. The only point of simulating X inside Di after time Tj is to 
learn, upon entering Di through f , through which edge (u, u') G C we should leave. 

Let Pv{e) be the probability of X leaving Di through e after entering through vertex v. If we 
knew Pv{e) for all v £ V{Di) and all e G C{Di), then we could just, upon entering v, immediately 
choose the edge e through which we will exit according to distribution ^^(e) without computing 
the explicit trajectory of X in Di. That is, if we consider a shortcutting X of X that cuts out from 
X all trajectories inside Di after it was explored, then Pv{e) would be all that we need to simulate 
X in time proportional to the length of X (as opposed to the length of X). 

Now, the point is that we can efficiently compute an e-approximation of Pvie), as we will show in 
section [3TT| which will enable us to compute this shortcutting to fairly high precision. Furthermore, 
as we will see shortly, the analysis of structure of X that we performed in the previous section shows 
that the computation of these 'unnecessary' trajectories constitutes the bulk of the work involved 
in a faithful simulation of X, and therefore X has much smaller length while yielding the same 
distribution on random arborescences. 
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To formalize the above intuitions, let X = (Xi) be a random walk obtained in the follow- 
ing way. Let X{u)) = Xq{lj), . . . X^(^^^^{u) be some concrete trajectory of X, and let X{lj) = 
. . . , be decomposition of X{lo) into contiguous blocks Xj{uj) that are contained 

inside Di. for some sequence ij G {0, ... ,A;}, where we adopt the convention that Dq = S. We 
define X{u;) as follows. We process X{uj) block-by-block and we copy a block Xj{uj) to X(a;) as 
long as Ti- has not occurred yet or ij = 0, otherwise we copy to -'^(i^) only the first and last entries 
of the block. (We shall refer to the latter event as a shortcutting of the block.) We proceed now to 
the formal proofs of the properties of X stated above. 

We start by showing that X can indeed be simulated efficiently given probabilities Pv{e) and 
some preprocessing time that will later be amortized into the overall running time of the algorithm: 

Lemma 7. Knowing Pv{e) for all e G C{Di), v G V{Di) and i, we can preprocess these values in 
0{(f)mn) time in such a way that it allows simulation of I steps of X in time 0{l). 

Proof: Simulating X before any Tj has occurred is straightforward. The only thing to show is that 
we can implement the shortcutting of the blocks, for which it would suffice to show that, using the 
Pu(e), we can sample an edge e from this distribution in polylogarithmic time. To see how this 
can be done, let us fix some i and some ordering ei, . . . , e|(7(£)^)| of the edges from C{Di). For each 
V G U{Di), we can create in 0{\C{Di)\) time an array Av{i) where Ay{i) = J2i<j<i ^vi^j), so we 

can do it for ah v G f/(A) in total time at most d{\V{Di)\ ■ |C(A)|)- 

Now, if we want to choose e according to P^(e), we choose a random number from r G [0, 1], 
use binary search to find an entry j in Ay{i) such that Ay{j) > r > Ay{j — 1), and output e = ej. 
Summing up the total preprocessing time for all Di yields the desired bound. I 

We are ready to prove the following lemma: 

Lemma 8. Given an {(p,^)- decomposition of G and the probabilities Pv{e), we can find a random 
arborescence of G in expected time 0{m{"f + (pn)). 

Proof: By the discussion above, it suffices to simulate the algorithm described in Theorem [1] using 
the walk X, so we just need to bound the expected number of steps it takes for X to cover the 
whole graph G. To do this, we upper-bound the expected length of the covering walk X after 
shortcutting it. However, this quantity is just E[Z^] + 3E[Z], since the two vertices that remain 
in X after shortcutting some block from X can be amortized into the number of traversals by X of 
edges from C. By Fact [5]and LemmaH we get that Y.i E[Z*] + 3E[Z] = d{Y,i \E{Di)\-f + (j)mn) = 
0(m(7 -|- (j)n)). By Lemma [TJ we can simulate a walk of this length in expected time 0(m(7 -|- (pn)). 
I 

In order to complete our algorithm, we thus need to show three things: we can quickly compute 
the Pv{e), a {4>, 7)-decomposition exists with good parameters, and we can find such a decomposition 
efficiently. 

3.1 Computing Pv{e) 

In the following lemma we address the first issue. 

Lemma 9. Given a (cj), ^) -decomposition ofG, we can compute multiplicative (l+e) -approximations 
of all of the Pv{e) in time 0{(pm? log 1/e). 
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Proof: Let us fix some D = Di and an edge e = {u,u') G C{D) with u G U{D). Consider now a 
graph D' that we obtain from D as follows. First, we add vertex u', and some dummy vertex u* to 
D, and then, for each {w,w') £ C{D) \ {e} with w G U{D), we add an edge (it;,u*). (Note that w' 
can be equal to u' .) Finally, we add the edge e = {u,u'). The crucial thing to notice now is that 
for any given vertex v £ D, Pv{e) is exactly the probability that a random walk in D' started at v 
will hit u' before it hits u* . We can compute such probabilities quickly using electrical flows. 

More precisely, (see, e.g., [16]) if we treat D' as an electrical circuit in which we impose voltage 
of 1 at u' and at u* , then the voltage achieved at v in such an electrical flow is equal to Pv{e)- 
We can compute a (1 + e)-approximation of such a flow in time 0{\E{D')\ log using the linear 
system solver of Spielman and Teng [18]. To do so, let L be the Laplacian of D\ where we let 
the first two rows correspond to u' and u*, respectively. Furthermore, let iext S rI^^-^'^I be the 
vector that is 1 in its first coordinate, —1 in its second coordinate, and zero everywhere else. Let 
v' G be the solution to the linear system Lv' = \ext, which we can approximately find in 

nearly- linear time using the solver from [18]. We obtain our desired vector of voltages v* from v' 
by subtracting V2 from all of its coordinates and dividing them by v'^ — v'^- It is worth noting that 
this approach to computing electrical flow was also used in Spielman and Srivastava [17J. 

Our algorithm computes such voltages for each edge e. (Note that we might have to compute 
two such voltages per edge — one for each endpoint of e — if they each lie in different From each 
flow, we store the probabilities Pvie) for all vertices v that we are interested in. The running time 
of such procedure is bounded by 0(|C| \E{I)i)\ log 1/e) = O^cpm? log 1/e), where we use the fact 
that for each D, \E{D')\ = \E{D)\ + \C{D)\ < 2\E{D)\ by the deflnition of a ((/>, 7)-decomposition. 
I 

Since the linear system solver only gives us approximate values for the Pvie), we need to show 
that we can control the overall error while still maintaining a good running time. We do this with 
the following lemma: 

Lemma 10. Given a {(j),^)- decomposition of G and multiplicative {1 + e)- approximations of all of 
the probabilities Pv{e), we can generate a 6-random arborescence of G in expected time 0{m{^ + 
(j)n)), as long as e < 5/mn. 

Proof: 

We deflne a new random walk X that approximates X as follows. The walk X simulates X 
for the first mn steps, except it uses the approximate values of the Pv{e) when it shortcuts a block. 
After the mn^^ step, it shortcuts blocks using the exact values of the P^(e). 

We start by noting that, for any t > and trajectory X{(jj)^ 

Pr[Xo = X(a;)o, ...,Xt = X{uj)t]/{1 + 5) 

<Pr[X; = X(u;)o,...,i; = X(u;)i] (1) 
<{l + 5) Pr[Xo = X(a;)o, ...,Xt = X{uj)t]. 

That is, for any trajectory X{lv)^ the probability that its first t entries appear as a prefix of 
the walk X is within & {1 + 5) multiplicative factor of the probability that this preflx appears as 
a prefix of the walk X. 

Indeed, each of the first mn steps of the walk X distorts the likelihood of appearance of a 
particular sequence of vertices by a multiplicative factor of (1 + e). Thus, since (1 + e)™-" < 
1 + emn < 1 + (5, the assertion follows. 
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It is easy to see that ([T|), together with Theorem [T] and the fact that the walk X is a shortcutting 
of the walk X, implies that simulating the walk X until it covers the graph G is sufficient to obtain 
a (5-random arborescence. Therefore, what is left to prove is that the simulation of the walk X 
until it covers the graph G can be done in expected time 0{m{'y + (pn)). 

We do this as follows. For the simulation of the first mn steps, we use the method from Lemma 
[7] with the (1 + e)-approximations of the probabilities P^(e). If the walk hasn't covered the graph 
by step mn, we continue the simulation of X by simply generating the entire transcript of the walk 
X until it covers the graph and then shortcutting it accordingly. 

To bound the running time of this procedure, we note that ([T]) implies that the expected number 
of steps necessary for X to cover G is at most {1 + 6) times the expected number of steps required 
by X, which, by Fact[5]and Lemma[6l is E[Z*] + 3E[Z] = 0{m{'y -\- (pn)). Therefore, by Lemma 
[71 we get that the part of the simulation that deals with the first mn steps runs in expected time 
0((1 + 6) Y^ - E[Z*] + 3ii^[Z]) = 0(m(7 + 4>n)), since we can always assume that S < 1. 

By the Markov inequality, we know that the probability that X ever takes more than mn steps 
is at most 0{m{-y + <j)n))/mn. We can bound the expected running time of the second part of 
the simulation by this probability multiplied by the the expected cover time of the whole graph, 
which is 0{mn). This gives the total expected running time of the simulation to be 0{m{'y -\- (pn) + 
m'^n{'~f + (pn)/mn) = 0(m(7 + (pn)), as desired. 

I 

We now proceed to the final ingredient of our algorithm — finding good (</), 7)-decompositions 
quickly. 

3.2 Obtaining good (0, 7)-decompositions quickly 

By using the ball-growing technique of Leighton and Rao [15] , we obtain the following: 

Lemma 11. For any G and any (p = o{\), there exists a {(p, 0{l/(p))- decomposition ofG. Moreover, 
such a decomposition can be computed in time 0{m). 

We omit the proof, as we shall prove a stronger statement in Lemma [T3l 

3.3 Generating 5-random arborescence in expected time 0{m'^/^/n log 1/6) 

We can now put these results together to generate ^-random arborescences in expected time 
0(m^ /^/nlogl/6). Note that this bound is worse than the one stated in Theorem [2] — we shall 
improve it to obtain the better time bound in section [H 

Let (p = l/n^^"^, and let e = 5/mn, as in Lemma[ini By Lemma[TT| we can get a Oin}^"^))- 
decomposition of G in 0{m) time. By LemmalU we can compute the estimates of Pv{e) for all rele- 
vant V and e in time 0(m^/y^log 1/e) = 0(m^/y^log 1/5). Having done this, we can use Lemma 
[8] (together with Lemma [T0|) to generate a (5-random arborescence in time 0(m^/y^log 1/6). 

4 Obtaining an 0{m^/nlogl/6) running time 

The bottleneck in the procedure presented above is the computation of probabilities Pv{e) — 
everything else can be done in time 0{m^/nlogl/6). Unfortunately, it is not clear how we could 
improve the running time of these computations. To circumvent this problem, we will alter our 
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procedure to use slightly different probabilities and a slightly different random walk that will end 
up yielding a faster simulation time. 

To introduce these probabilities, let us assume that there are no edges in G between different 
DiS (which we will ensure to be the case), and let Ci{u) for a vertex n G 5 be the set of edges 
incident both to u and the component Dj. Now, for a given some v G V{Di), and u & S with 
|Cj(ii)| > 0, we define Qv{u) to be the probability that u is the first vertex not in V{Di) that 
is reached by a random walk that starts at v. We will use these probabilities to simulate a new 
random walk X that we now define. For given trajectory X{uj) of walk X, X{ij) is equal to X{u;) 
(as defined before), except that whenever X(ij) shortcuts some block visited by X, X{uj) contains 
only the first vertex visited in this block, as opposed to both first and last vertices retained in 
X{uj). X is a shortcutting of X and is thus a shortcutting of X as well. 

It is not hard to see that by using Qv{u) in a way completely analogous to the way we used Py(e) 
before, we can simulate the walk X efficiently, and the expected length of this walk is bounded 
by the expected length of the walk X. However, unlike X, X does not necessarily posess all of 
the information needed to reconstruct the final arborescence. This shortcoming manifests itself 
whenever some u is visited for the first time in X directly after X entered some -D, after has 
already occurred. In this case, we know that the corresponding trajectory of the walk X visited u 
for the first time through some edge whose other end was in Di (and thus we should add it to our 
arborescence as e^), but we don't know which one it was. 

To deal with this problem, we will define a stronger decomposition of G whose properties will 
imply that the above failure to learn e„ will occur only for a small number of vertices u. Then, at 
the end of our simulation of the walk X, we will employ a procedure that will compute the missing 
arcs in a manner that will not distort the desired distribution over output arborescences. 

We proceed now to formalizing the above outline. 

4.1 Strong (0, 7)-decompositions 

The number of probabilities Qv{u) that we need to compute now depends on the number of vertices 
that are connected to the DiS through some edge from C, rather than just the number of edges of 
C. To control this, we introduce the following stronger graph decomposition: 

Definition 12. A partition {Di, . . . , Dk, S, C) of G is a strong ((/>, 7)-decomposition of G if: 

1. {Di, . . . , Dk, S,C) is a ((j),j)-decomposition, 

2. there are no edges between different DiS (i.e. S is a vertex multiway cut), and 

3. \C{S)\ < (p\V{G)\, where C{S) is the set of vertices from S that are connected by an edge to 
some Di. 

We prove the following: 

Lemma 13. For any G and any (p = o(l), there exists a strong {(j),0{l/(j)))- decomposition of G. 
Moreover, such a decomposition can be computed in time 0{m). 

Proof: We will use the ball-growing technique of Leighton and Rao [L5l as presented by Trevisan 
[19] . For a graph H, let BH{v,j) be the ball of radius j around the vertex v in H, i.e., let Bnivjj) 
consist of the subgraph of H induced by all vertices in H that are reachable by a path of length at 
most j from v. Furthermore, let R}{{v,j) be the set of vertices that are at distance exactly j from 
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Table 1: A procedure finding a strong ((/>, O (!/(/>) )-decomposition of G 



• Set = G, 5 = 0, C = 0, D = {}, i = 1 and t = - </>) 




• While / 




• (* Ball-growing *) 




— Choose an arbitrary v £ H, set j = 




- (*) As long as \RH{v,j + 1)| > t\V{BH{v,j))\, \Rjj{v,j + 1 
\^H\^^i + 1)1 > ^^\Bn\v,3))\: 


)\ > t\EiBHiv,j))\ or 


- j=j+l 




— Let ji be the j at which the above loop stops. Add RH{v,ji 
edges incident to RH{v,ji + 1) (i.e. R'jj{v,ji + 1) U Rjj{v,j 
Bniv^ji) as component Di to D. 


+ 1) to S, add all the 

, + 1)) to G, and add 


• output the resulting partition {Di, . . . , Dk, S, C) of G 





V in H. Finally, let R^{v,j) {R^{v,j) respectively) be the set E{BH{v,j + 1)) \ E{BH{v,j)) (the 
set E{BH{v,j)) \ E{BHiv,j - 1)) respectively). 

Consider now the procedure presented in Table [TJ First, we note that this procedure can be 
implemented in nearly-linear time, since each edge is examined at most twice before it is removed 
from H. Moreover, an elementary charging argument shows that, in the resulting partition, \G\ < 
(1/(1 + l/t))\E{G)\ = (t>\E{G)\, and similarly \G{S)\ = \S\ < 4>\V{G)\. By construction, there are 
no edges between distinct D^s. We want to argue now that for all i, ji < 3(l-|-log |ii^(G)|/ log(l-l-t)), 
which in turn would imply that all of the DjS have diameter at most 6(1 + log |-E'(G)|)/log(l + t) = 
6(1 -Flog |^(G)|)/log(l/(l - (/)))) = 0(logm/(-log(l - (j)))) = 0{logm/(j)), where we used Taylor 
expansion of log(l — x) around x = to get this estimate. To see why the above bound on ji holds, 
assume that it was not the case for some i and v. Then, during the corresponding ball-growing 
procedure, a particular one of the three conditions from (*) must have been triggered more than 
ji/3 = 1 + log|^(G)|/log(l + t) times. If this condition was {RnivJ + 1)| > t\V{BHiv, j))\, 
then, since we never remove vertices from our ball that is being grown and Bh iv,0) has one 
vertex, the final ball BH{v,ji) has to have at least (1 + ty^^^ > \E{G)\ > |^(G)| vertices, which 
is a contradiction. Similarly, if iR^ivJ + 1)| > t\E{BH{v,j))\ {\R]^{v,j + 1)| > t\E{BH{v, j))\ 
respectively) was the condition in question, then \E{BHiji,v))\ > (1 + ty^^^ > \E{G)\, which is a 
contradiction as well. Thus we may conclude that the above bound on ji holds, and all DiS have 
diameter O(l/0), as desired. 

At this point, we know that the partition of G that we obtained satisfies all of the properties of 
a strong ((/), 0(l/(/>))-decomposition except possibly the one that asserts that there is no Di such 
that \E{Di)\ is smaller than the number |C(I?j)| of edges from G incident to Di. However, if such 
Di exist, then we can just add them to our cut, i.e., we add V{Di) to S, and E{D,j) to G. Note that 
the size of G can at most triple as a result of this operation, since an edge that is initially in G can 
be incident to at most two of the Di, and edges E{Di) are by definition not incident to any other 
Dj. Similarly, G{S) does not increase as a result of adding V{D,i) to S. We may therefore conclude 
that the decomposition returned by this algorithm is indeed a strong (0, 0(l/(/>))-decomposition of 
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G. 

I 

From now on we fix some strong {(f), 7)-decomposition of G. 

4.2 Computing Qv{u) 

By using an approach similar to the one that we used when computing the Pyie) values, we get the 
following lemma. 

Lemma 14. Given a strong {(p,^)- decomposition of G, we can compute multiplicative (1 + e)- 
approximations of all Qv{u) in time O {(j)mn log 1/e). 

Proof: Let us fix some D = Di, let S' be the set of vertices in w £ S with |Cj(u))| > 0, and let 
us fix some u € 5'. Consider now a graph D' that we obtain from D U S' hy merging all vertices 
in S' \ {u} into one vertex called u* . For any given vertex v £ D, Qv{u) is exactly the probability 
that a random walk in D' started at v will hit u before it hits u* . We can quickly compute such 
probabilities using electrical fiows. 

More precisely, (see e.g. |16]) if we treat D' as electrical circuit in which we impose voltage of 
1 at u and at , then the voltage achieved at v in such electrical flow is equal to Qy (e) . We can 
compute a (1 + e)-approximation of such a flow in time 0{\E{D')\ log 1/e) using the linear system 
solver of Spielman and Teng [18j. To do so, let L be the Laplacian of -D', where we let the first 
two rows correspond to u and u*, respectively. Furthermore, let \ext S M'^^^ be the vector that 
is 1 in its first coordinate, —1 in its second coordinate, and zero everywhere else. Let v' be the 
solution to the linear system Lv' = \ext-, which we can approximately find in nearly-linear time 
using the solver from [18]. We obtain our desired vector of voltages by subtracting V2 from all if 
its coordinates and dividing them by v'l — ^2- 

Our algorithm computes such voltages for each u € S and all Di with |Ci(n)| > 0, and from 
each such computation it stores the probabilities Qv{u) for all vertices v S V{Di) that we are 
interested in. The running time of such procedure is bounded by 0{\C{S)\ \E{Di)\ log 1/e) = 
0((/)mn log 1/e), as desired. I 

4.3 Coping with shortcomings of X 

As mentioned above, the walk X can be simulated more efficiently than the walk X, but it does 
not have all the information needed to construct the arborescence that would be generated by the 
walk X that X is meant to shortcut. This lack of information occurs only when some vertex u 
is visited for the first time by X immediately after X visited a component Di after time Tj has 
already occurred. Note that, by the properties of the strong (0, 7)-decomposition that we are using, 
it must be the case that u G C{S) and |C(S')| < (j)n. This shows that X fails to estimate the arcs 
eu for a small fraction of vertices of G. We prove now that, in this case, these missing arcs can be 
reconstructed efficently and in a way that preserves the desired distribution over arborescences. 

Lemma 15. For a trajectory X{uj) that starts at vertex s, let F{X{uj)) be the set of arcs Cy, for 
V ^ G{S) U {s}, corresponding to this trajectory. Let F{X{ui))* be the set of all arborescences H 
rooted at s such that eniv) = Cy for v ^ C{S) U {s}. Then, given F{X{lo)), we can generate a 
random arborescence from F(X{lo))* in time 0{m + ((/>n)^-^'^^). 
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Proof: For brevity, let us define F := F{X{uj)). Now, let Hi, . . . , Hr, with s E Hi, be the 
decomposition of F into weakly connected components. We construct a directed graph G{F, s) as 
follows. G{F, s) has a vertex /i-,- for each Hj. E{G{F, s)) is the set of all arcs {hj, hi) such that there 
exists v,u €z G with v £ Hj, u £ Hi and u S G{S) \ {s}. By definition, if H' is an arborescence in 
G{F, s) rooted at hi, then H' D F is an arborescence in G rooted at s and H' U F £ F*. Moreover, 
for any arborescence H £ F* , H' = {e/f(u) | v £ C{S) \ {s}} is an arborescence in G{F, s) rooted at 
hi. So, if we use the algorithm of Colbourn et al. [8j to generate random arborescence H' in G{F, s) 
rooted at hi, then ff' U F is a random arborescence from F* . Since \V{G{F,s))\ = |C(5)| < (j)n 
and the algorithm from [8] works in time 0{\V {G{F, s))\^''^'^^), the lemma follows. 

I 

4.4 Proof of Theorem [2] 

By the connection explained in section 11.11 it is sufficient to devise a procedure that generates 
5-random arborescences. We do this as follows. We fix = l/^/n, and, using Lemma [T3l we get 
a strong (l/-^/n, 0(-yn))-decomposition of G in 0{m) time. Now, using Lemma [HI we compute £- 
approximations of all of the probabilities Qv{u) in time 0(m\/n log 1/e) = 0{m^/n\ogl/ 5), where 
we set e = 0{5/rV'). At this point, we can use completely analogous reasoning to that used in 
Lemmas [8] and Lemma [TO] to prove that we can simulate (^-approximately) the walk X in expected 
time 0{m^/n) (where we use in particular the fact that X is a shortcutting of X). Having done 
this, we look at the directed forest F consisting of arcs e„ for v ^ C{S) U {s} as defined by our 
simulation of X . We then use the procedure from Lemma [TSl to get an arborescence T of G in time 
0(m + n^'^^^/^) = 0{m^/n). The whole algorithm therefore runs in expected time 0(m-^/n log 'VjS). 

To see that the above algorithm generates a (conditionally) 5-random arborescence, let us 
consider some arborescence T of G rooted at some vertex s and condition what will follow on the 
event that the algorithm outputs an arborescence rooted at s . Let -F(r) = {eT{v) \ v ^ C(5)U{s}}. 
By the fact that X is a ((^-approximate) shortcutting of the walk X and by Theorem [H we know 
that F{X{uj)) = F{T) for at least a (1 - 6)\F{Ty\/\%{G)\ and at most a (1 + 6)\F{Ty\/\Ts{G)\ 
fraction of trajectories, where F{T)* is the set of all arborescences compatible with F{T) in the 
sense of Lemma [T5j Since the procedure from Lemma [15] generates a random arborescence from 
F(T)*, T is generated with probability at least (1 — 6)/\Ts{G)\ and at most (1 -|- 6)/\Ts{G)\. This 
concludes the proof of Theorem [2j 
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